Learning from narrated instruction videos
نویسندگان
چکیده
Automatic assistants could guide a person or a robot in performing new tasks, such as changing a car tire or repotting a plant. Creating such assistants, however, is non-trivial and requires understanding of visual and verbal content of a video. Towards this goal, we here address the problem of automatically learning the main steps of a task from a set of narrated instruction videos. We develop a new unsupervised learning approach that takes advantage of the complementary nature of the input video and the associated narration. The method sequentially clusters textual and visual representations of a task, where the two clustering problems are linked by joint constraints to obtain a single coherent sequence of steps in both modalities. To evaluate our method, we collect and annotate a new challenging dataset of real-world instruction videos from the Internet. The dataset contains videos for five different tasks with complex interactions between people and objects, captured in a variety of indoor and outdoor settings. We experimentally demonstrate that the proposed method can automatically discover, learn and localize the main steps of a task input videos.
منابع مشابه
Revealing the Programming Process – Using Videos to Unfold Basic Programming Techniques
The most important part of an introductory programming course is to teach the students a systematic approach to the development of computer programs. Revealing the programming process is an important part of this; however, textbooks do not address the issue – probably because the medium is unsuitable for this kind of descriptions. We have found that videos in the form of narrated programming se...
متن کاملFlipped Classroom Strategies Using Online Videos
The basic principles of a flipped classroom teaching method are to deliver instruction online and to move active learning into the classroom. There are many strategies for delivering the instruction online, such as, preparing online lectures by the course instructors, wrapping the course around a MOOC, and collecting online videos from various sources. There are also many strategies for includi...
متن کاملThe Effect of Colligational Corpus-based Instruction on Enhancing the Pragmalinguistic Knowledge of Request Speech Act among Iranian Intermediate EFL Learners
This study investigated the effectiveness of colligational corpus-based instruction on enhancing the pragmalinguistic knowledge of speech act of request among Iranian intermediate EFL learners. The objective of the study was to find out whether or not providing students with corpora through using colligational instruction had any significant effects on enhancing their pragmalinguistic knowledge...
متن کاملEvaluation of Midwifery Student's Attitude, Performance and Satisfaction from teaching clinical skills with the Video in Hamedan School of Nursing and Midwifery (2019)
1. Duncan I, Yarwood-Ross L, Haigh C..YouTube as a source of clinical skills education. Nurse Eduction. .2013; 33 (12): 1576–1580 2. Arguel ., Jamet E. Using video and static pictures to improve learning of procedural contents.Comput. Hum. Behav.2008; 25 (2):354–359. 3. Johnson N, List-Ivankovic J, Eboh W, Ireland ., Adams D, Mowatt E, Martindale S. Research and evidence based pra...
متن کاملThe Effect of Colligational Corpus-based Instruction on Enhancing the Pragmalinguistic Knowledge of Request Speech Act among Iranian Intermediate EFL Learners
This study investigated the effectiveness of colligational corpus-based instruction on enhancing the pragmalinguistic knowledge of speech act of request among Iranian intermediate EFL learners. The objective of the study was to find out whether or not providing students with corpora through using colligational instruction had any significant effects on enhancing their pragmalinguistic knowledge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE transactions on pattern analysis and machine intelligence
دوره شماره
صفحات -
تاریخ انتشار 2017